Clustering speakers by their voices

نویسندگان

Alex Solomonoff

Angela Mielke

Michael Schmidt

Herbert Gish

چکیده

The problem of clustering speakers by their voices is addressed. With the mushrooming of available speech data from television broadcasts to voice mail, automatic systems for archive retrieval, organizing and labeling by speaker are necessary. Clustering conversations by speaker is a solution to all three of the above tasks. Another application for speaker clustering is to group utterances together for speaker adaptation in speech recognition. Metrics based on purity and completeness of clusters are introduced. Next our approach to speaker clustering is described and finally experimental results on a subset of the Switchboard corpus are presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic and Spectral iVectors for Expressive Speech Synthesis

This work presents a study on the suitability of prosodic and acoustic features, with a special focus on i-vectors, in expressive speech analysis and synthesis. For each utterance of two different databases, a laboratory recorded emotional acted speech, and an audiobook, several prosodic and acoustic features are extracted. Among them, i-vectors are built not only on the MFCC base, but also on ...

متن کامل

Perceptual scaling of voice identity: common dimensions for different vowels and speakers.

THE AIMS OF OUR STUDY WERE (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space suc...

متن کامل

Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news

The average pitch of 68 news broadcasters (34 female / 34 male speakers) was evaluated by 6 expert listeners. Additionally, the average fundamental frequency for all samples was analyzed by means of a series of standard pitch detection algorithms. The results show a strong correlation of acoustic mean and auditory median values for male voices, whereas the auditory mean values female voices are...

متن کامل

Perceptive and acoustic measurement of av male speakers in Germ

متن کامل

Building personalised synthetic voices for individuals with severe speech impairment

For individuals with severe speech impairment accurate spoken communication can be difficult and require considerable effort. Some may choose to use a voice output communication aid (or VOCA) to support their spoken communication needs. A VOCA typically takes input from the user through a keyboard or switch-based interface and produces spoken output using either synthesised or recorded speech. ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Clustering speakers by their voices

نویسندگان

چکیده

منابع مشابه

Prosodic and Spectral iVectors for Expressive Speech Synthesis

Perceptual scaling of voice identity: common dimensions for different vowels and speakers.

Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news

Perceptive and acoustic measurement of av male speakers in Germ

Building personalised synthetic voices for individuals with severe speech impairment

عنوان ژورنال:

اشتراک گذاری